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Abstract 

Summarizing topological information from datasets and maps defined on them is 
a central theme in topological data analysis. Mapper, a tool for such summarization, 
takes as input both a possibly high dimensional dataset and a map defined on the data, 
and produces a summary of the data by using a cover of the codomain of the map. 
This cover, via a pullback operation to the domain, produces a simplicial complex 
connecting the data points. 

The resulting view of the data through a cover of the codomain offers flexibility in 
analyzing the data. However, it offers only a view at a fixed scale at which the cover 
is constructed. Inspired by the concept, we explore a notion of a tower of covers which 
induces a tower of simplicial complexes connected by simplicial maps, which we call 
multiscale mapper. We study the resulting structure, its stability, and design practical 
algorithms to compute its persistence diagrams efficiently. Specifically, when the do¬ 
main is a simplicial complex and the map is a real-valued piecewise-linear function, the 
algorithm can compute the exact persistence diagram only from the 1-skeleton of the 
input complex. For general maps, we present a combinatorial version of the algorithm 
that acts only on vertex sets connected by the 1-skeleton graph, and this algorithm 
approximates the exact persistence diagram thanks to a stability result that we show 
to hold. We also relate the multiscale mapper with the Cech complexes arising from a 
natural pullback pseudometric defined on the input domain. 
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1 Introduction 

Recent years have witnessed signihcant progress in applying topological ideas to analyzing 
complex and diverse data. Topological ideas can be particularly powerful in deriving a 
succinct and meaningful summary of input data. For example, the theory of persistent 
homology built upon [12112312S] and other fundamental developments [31IH El 113 IHl 113113 
EZ], has provided a powerful and flexible framework for summarizing information of an input 
space or a scalar held into a much simpler object called the persistence diagram/barcode. 

Modern data can be complex both in terms of the domain where they come from and 
in terms of properties/observations associated with them which are often modeled as maps. 
For example, we can have a set of patients, where each patient is associated with multiple 
biological markers, giving rise to a multivariate map from the space of patients to an image 
domain that may or may not be the Euclidean space. To this end, we need to develop 
theoretically justihed methods to analyze not only real-valued scalar helds, but also more 
complex maps dehned on a given domain, such as multivariate, circle valued, sphere valued 
maps, etc. 
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There has been interesting work in this direction, including multidimensional persistence 
m and persistent homology for circular valued maps [3] . However, summarizing multivari¬ 
ate maps using these techniques appears to be challenging. Our approach takes a different 
direction, and is inspired by and based on the mapper methodology, recently introduced 
by Singh et al. in [26]. Taking an observation made in [26] regarding the behavior of 
Mapper under a change in the covers as a starting point, we study a multiscale version of 
mapper, which we will henceforth refer to as multiscale mapper, that is capable of producing 
a multiscale summary using a cover of the codomain at different scales. 

Given a map f : X ^ Z, Singh et al. proposed a novel concept to create a topological 
metaphor, called the mapper, for the structure behind / by pulling back a cover of the space 
Z to 8a cover on X through /. This mapper methodology is general: it can work with any 
(reasonably tame) continuous maps between two topological spaces, and it converts complex 
maps and covers of the target space, into simplicial complexes, which are much easier to 
manipulate computationally. It is also powerful and flexible- one can view the map / and 
a hnite cover of the space Z as the lens through which the input data X is examined. By 
choosing different maps and covers, the resulting mapper representation captures different 
aspects of the input data. Indeed, the mapper methodology has been successfully applied 
to analyzing various types of data, see e.g. [saiMi, and it is a main component behind the 
data analytics platform developed by the company Ayasdi. 

1.1 Contributions 

Given an input map f : X ^ Z and a hnite cover U of Z, the induced mapper M(U, /) 
is a simplicial complex encoding the structure of / through the lens of Z. However, the 
simplicial complex M(U, f) provides only one snapshot of X at a fixed scale as determined 
by the scale of the cover U. Using the idea of persistence homology, we study the evolution 
of the mapper M(/, for a tower of covers il = at multiple scales (indexed by e). 

As an intuitive example, consider a real-valued function / : X —)■ M, and a cover of M 
consisting of all possible intervals of length £. Intuitively, as £ tends to 0, the corresponding 
Mapper M{f,Us) approaches the Reeb graph of /. As £ increases, we look at the Reeb graph 
at coarser and coarser resolution. The multiscale mapper in this case roughly encodes this 
simplihcation process. 

The multiscale mapper MM(il, /), which we formally dehne in ^ consists of a sequence 
of simplicial complexes connected with simplicial maps. Upon passing to homology with 
helds coefficients, the information in MM(il, /) can be further summarized in the corre¬ 
sponding persistence diagram. In other words, we can now summarize an input described 
by a multivariate (or circle/sphere valued) map into a single persistence diagram, much like 
in traditional persistence homology for real-valued functions. 

In ^ we discuss the stability of the multiscale mapper, under changes in the input map 
and/or in the tower of covers il. Stability is a highly desirable property for a summary as 
it implies robustness to noise in data and in measurements. Interestingly, analogous to the 
case of homology versus persistence homology, mapper does not satisfy a stability property, 
whereas multiscale mapper does enjoy stability as we show in this paper. 

To facilitate the broader usage of mapper and multiscale mapper as a data analysis tool, 
we develop efficient algorithms for computing and approximating mapper and multiscale 
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mapper. In particular, even for piecewise-linear functions defined on a simplicial complex, 
the standard algorithm needs to determine for each simplex the subset (partial simplex) on 
which the function value falls within a certain range. 

In ^ we show that for such an input, it is sufficient to consider the restriction of the 
function to the 1-skeleton of the complex for computing the mapper and the multiscale 
mapper. Since the 1-skeleton (a graph) is typically much smaller in size than the full complex, 
this helps improving the time efficiency of computing the mapper and multiscale mapper 
outputs. 

In ^ we consider the more general case of a map f : X ^ Z where X is a simpli¬ 
cial complex but Z is not necessarily real-valued. We show that there is an even simpler 
combinatorial version of the multiscale mapper, which only acts on vertex sets of X with 
connectivity given by the 1-skeleton graph of X Q The cost we pay here is that the result¬ 
ing persistence diagram approximates (instead of computing exactly) that of the standard 
multiscale mapper, and the tower of covers of Z needs to satisfy a “goodness” condition. 

In ^ we show that given a tower of covers it and a map / : X —)■ Z there exists a natural 
pull-back pseudo-metric dehned on the input domain X. With such a pseudo-metric on 
X, we can now construct the standard Cech hltration € = {Ceche(X)}£ (or Rips hltration) 
in X directly, instead of computing the Nerve complex of the pull-back covers as required 
by mapper. The resulting hltration € is connected by inclusion maps instead of simplicial 
maps. This is easier for computational purposes even though one has a method to compute 
the persistence diagram of a hltration involving arbitrary simplicial maps [T5j. Furthermore, 
it turns out that the resulting sequence of Cech complexes £ interleaves with the sequence of 
complexes MM(il,/), implying that their corresponding persistence diagrams approximate 
each other. 

Some technical details and proofs are relegated to the Appendix. 


2 Topological background and motivation 

In this section we recall several facts about topological spaces and simplicial complexes [23] . 
Let K and L be two hnite simplicial complexes over the vertex sets Vk and Vl, respectively. 
A set map 0 : t Vl is a simplicial map if 0(cr) G L for all a E K. 

By an open cover of a topological space X we mean a collection U = {Ua}a£A of open 
sets such that IJaeA = -A. In this paper, whenever referring to an open cover, we will 
always assume that each Ua is path connected. 

Definition 1 (Nerve of a cover). Given a finite cover U = {Ua}oL&A of a topological space 
X, we define the nerve of the cover U to he the simplicial complex N{U) whose vertex set is 
the index set A, and where a subset {oq, • • •, «A:} ^ A spans a k-simplex in N{U) if and 
only if t/ao n 17a, n ... n Uaf, 0. 

Suppose that we are given a topological space X equipped with a continuous map f : X ^ Z 
into a parameter space Z, where Z is equipped with an open cover U = {Ua}aeA for some 
hnite index set A. Since / is continuous, the sets {/“^(17 q), a E A} form an open cover of 

^We note that a variant of this combinatorial version is what is currently used in the publicly available 
software for mapper in practice. 


4 



X. For each a, we can now consider the decomposition of f~^{Ua) into its path connected 
components, so we write f~^{Ua) = U£i ^,*5 where ja is the number of path connected 
components Va/s in f~^{Ua)- We write f*{U) for the cover of X obtained this way from 
the cover U of Z and refer to it as the pullback cover of X induced by U via /. 

Notice that there are pathological examples of / where f~^{Uo,) may shatter into inhnitely 
many path components. This motivates us to consider well-behaved functions /: we require 
that for every path connected open set U ^ Z, the preimage f~^{U) has finitely many path 
connected components. An example of well-behaved functions is given by piecewise-linear 
real valued functions dehned on hnite simplicial complexes. It follows that for any well- 
behaved function / : X —)■ Z and any hnite open cover U of Z, the open cover f*{U) is also 
hnite. 


If not stated otherwise, all functions and all covers are assumed to be well-behaved 
and finite, respectively. Consequently, all nerves of pullbacks of finite covers will 
also be finite. 

Definition 2 (Mapper [26]). Let X and Z be topological spaces and let f : X ^ Z be 
a well-behaved and continuous map. Let U = {Lfa}a&A be a finite open cover of Z. The 
mapper construction arising from these data is defined to be the nerve simplicial complex of 
the pullback cover: M(W,/) := N{f*{U)). 

Remark 2.1 (For Dehnitionj^. This construction is quite general. It encompasses both the 
Reeb graph and merge trees at once: consider X a topological space and / : X —)■ M. Then, 
consider the following two options for U = {Ua}a&A, the other ingredient of the construction: 

• Ua = (—oo, a) for a G A = M. This corresponds to sublevel sets which in turn lead to 
merge trees. 

• Ua = {a — e,a -\- e) for a G A = M, for some fixed £ > 0. This corresponds to (e-thick) 
level sets, which induce a relaxed notion of Reeb graphs. 

In these two examples, for simplicity of presentation, the set A is allowed to have infinite 
cardinality. Also, note one can take any open cover of M in this definition. This may give 
rise to other constructions beyond merge trees or Reeb graphs. For instance, one may choose 
any point r G M and let Ua = {r — a,r + a) for each a G A = M or other constructions. 

Maps between covers. 

If we have two covers U = {Ua}aeA and V = {Vg}y^B of a space X, a map of covers from U 
to V is a set map : A ^ B so that Ua F V^(^a) for all a G A. By an abuse of notation we also 
use f to indicate the map W —> V. Given such a map of covers, there is an induced simplicial 
map X(^) : N{IA) — >■ X(V), given on vertices by the map f. Furthermore, if W V W 
are three diherent covers of a topological space with the intervening maps of covers between 
them, then N{( o ff) = N{C) o N{(f) as well. 

The following simple lemma will be very useful later on. 

Lemma 2.1 (Induced maps are contiguous). Let (,f:LL^Vbe any two maps of covers. 
Then, the simplicial maps N{() and are contiguous. 
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Recall that two simplicial maps hi, h 2 : K ^ L are contiguous if for all cr G -R it holds 
that hi((T)Uh2(cr) G L. In particular, contiguous maps induce identical maps at the homology 
level [23]. Lemma 2.1 implies that the map H^{N{U)) —)■ if*(iV(V)) thus induced can be 
deemed canonical. 


Proof of lemma 2.1 


both 


Write U 


= {Ua}a&A and V = Then, for all a G A we have 

Ua ^ and U(^ ^ 


This means that Ua C n V^(q) for all a ^ A. Now take any o G N{U). We need to 
prove that C(o') G iV(V). For this write 


/3eC(a')U^(cr) V«ecr / \a&cr ) 

= Pi (^C(o) ^5(«)) 

aGcr 

2 n f'o ^ 

aGcr 

where the last step follows from assuming that a G N{U). □ 

Pullbacks. 

When we consider a space X equipped with a continuous map / : X —)■ Z to a topological 
space Z, and we are given a map of covers f \ U ^ V between covers of Z, there is a 
corresponding map of covers between the respective pullback covers of X: f*{f) : f*{U) —> 

r(v). 

Indeed, we only need to note that if f/ C R, then f~^{U) C f~^{V), and therefore 
it is clear that each path connected component of f~^{U) is included in exactly one path 
connected component of f~^{V). More precisely, let U = {Ua}aeA, V = with 

Ua C V^(q,) for a E A. Let Ua,i, f G {1,..., Uq] denote the connected components of f~^{Ua) 
and Vgj, j G {1,... ,my 3 } denote the connected components of /“^(V^). Then, the map of 
covers f*{f) from f*{U) to /*(V) is given by requiring that each set Ua,i is sent to the unique 
set of the form V^(^a),j so that Ua,i C V^(a)j. 

Furthermore, if W V W are three different covers of a topological space with the 
intervening maps of covers between them, then /*(C ° 0 — f*iO ° f*{0- 


3 Multiscale Mapper 

In the dehnition below, objects can be covers, simplicial complexes, or vector spaces. 

Definition 3 (Tower). A tower OR with resolution r G M is any collection 2IJ = of 

objects We together with maps We,£' : W^ —)■ W^/ so that = id and ow^^e' = Ws,£" for 
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, g./ . 

all r < e < e' < e”. Sometimes we write Q2J = {We —^ denote the collection 

with the maps. Given such a tower QH, res(Q2J) refers to its resolution. 

When W is a collection of finite covers equipped with maps of covers between them, we 
call it a tower of covers. When W is a collection of finite simplicial complexes equipped with 
simplicial maps between them, we call it a tower of simplicial complexes. 

The notion of resolntion, and the variable e, intnitively specify the grannlarity of the 
covers and the simplicial complexes indnced by them. 

The pnllback properties described at the end of ^ make it possible to take the pnllback 
of a given tower of covers of a space via a given continnons fnnction into another space, so 
that we obtain: 

Proposition 3.1. Let id be a tower of covers of Z and f : X ^ Z be a continuous function. 
Then, /*(il) is a tower of covers of X. 

In general, given a tower of covers QU of a space X, the nerve of each cover in QU together 
with simplicial maps induced by each map of QU provides a tower of simplicial complexes 
which we denote by iV(QU). 

Definition 4 (Multiscale Mapper). Let X and Z be topological spaces and f : X ^ Z be a 
continuous map. Let d be a tower of covers of Z. Then, the multiscale mapper is defined 
to be the tower of simplicial complexes defined by the nerve of the pullback: 

MM(il,/) :=7V(r(il)). 

Consider for example a sequence res(il) < Si < 62 < ■ ■ ■ < Sn oi n distinct real numbers. 
Then, the dehnition of multiscale mapper MM (it, /) gives rise to the following: 

N(nw)) ^ N(r(u„)) ^^ N{r{u,j) ( 3 . 1 ) 

which is a sequence of simplicial complexes connected by simplicial maps. 

Applying to them the homology functor Hfc(-), A: = 0,1, 2,..., with coefficients in a held, 
one obtains persistence modules mi: tower of vector spaces connected by linear maps for 
which efficient persistence algorithm is known jl5j : 

Hi(Af(r(M..))) ^ ... ^ H,(/V(/-(M,J)). (3.2) 

More importantly, the information contained in a persistence module can be summarized 
by simple descriptors: its associated persistence diagrams. As pointed out in [6] , a hniteness 
condition is required, but given our assumptions that the covers are hnite and that the 
function / is well-behaved we obtain that the homology groups of all nerves have hnite 
dimensions. Now one can summarize the persistence module with a hnite persistent diagram 
for the sequence MM(il,/), denoted by DkMM(il,/) for each dimension k eN (see [I7| for 
background on persistence diagrams). 
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4 Stability 


To be useful in practice, the multiscale mapper should be stable against the perturbations 
in the maps and the covers, we show that such a stability is enjoyed by the multiscale 
mapper under some natural condition on the tower of covers. The notion of stability in the 
context of topological data analysis has been recently studied by many researchers, see e.g. 
[H El El m El]- In particular, Cohen-Steiner et ah [H] showed that persistence diagrams 
are stable in the bottleneck distance meaning that small perturbations to a map yield small 
variations in the computed persistence diagrams. In [6], stability is expressed directly at the 


(algebraic) level of persistence modules (3.2) via a quantitative structural condition called 


interleaving of pairs of persistence modules. Before we state our stability results, we identify 
compatible notions of interleaving for tower of covers and tower of simplicial complexes, as 
a way to measure the “closeness” between two tower of covers (or two tower of simplicial 
complexes). 


4.1 Interleaving of towers of covers and simplicial complexes 

In this section we consider towers of covers and simplicial complexes indexed over M. In 
practice, we often have towers indexed by a discrete set in M. Any such tower can be 
extended to a tower of covers (or simplicial complexes) indexed over M by taking the covers 
for any intermediate open interval {e, e') C M to be same as that at e G M. 

Definition 5 (Interleaving of towers of covers). Let it = {We} and QJ = {Ve} be two towers 
of covers of a topological space X such that res(it) = res(QJ) = r. Given rj >0, we say that it 
and QJ are rj-interleaved if one can find maps of covers Q '-hie ^ H+j? is' '■ 
for all s, s' > r. 

Proposition 4.1. (i) If IX and are rji-interleaved and QJ and QU are ri 2 -interleaved, then, 
it and on are (r/i + 7 ]2)-interleaved, (ii) Let f : X ^ Z be a continuous function and it and 
be two rj-interleaved tower of covers of Z. Then, /*(it) and f*(fU) are also rj-interleaved. 

Definition 6 (Interleaving of simplicial towers). Let © = {iSg and T = 

{% H Te^ be two towers of simplicial complexes where res(©) = res(T) = r. We say 

that they are rj > 0 interleaved if for each e > r one can find simplicial maps —)■ 7}+,, 

and fje ■ Te ^ <Se+ri SO that: 

(i) for all e >r, fje+ri ° Ts o,nd Se,£+ 2 ri o,re contiguous, 

(ii) for all e >r, (fe+ri ° 'f’e o,nd te,e+ 2 ?? are contiguous, 

(Hi) for all s' > e >r, o Se,e' and fe+r),£'+?? o Te are contiguous, 

(iv) for all s' > e >r, s,£^ri,e'+r) ° f’e and ° is,s' are contiguous. 

These four conditions are summarized by requiring that the four diagrams below commute 
up to contiguity: 



^£,£ + 2t7 


(4.3) 


5. 


5, 


5, 


^£+2?? ^e+r] 

‘■Pe '0£+Tj '0e P^e+ri 


% 


£+ 5 ? 


% 


ie,e+2r} 


■Te 


e+2rj 


5. 


5,. 




Z 


e+r] 


~, ' ^ Ze'+r] 

''e+i7,£'+7) 


5, 


®£ + 7J,e'+77 


£+ri 




'e'+rj 






Te 


Proposition 4.2. Let il and QJ be two rj-interleaved towers of eovers of X with res(il) = 
res(2J). Then, iV(il) and iV(2J) are also rj-interleaved. 

r e' 'i 

Proof. Let r denote the common resolution of il and 2J. Write il = \U^ hie ']and 

QJ = {Ve '^e']r<e<e'^ £ > r let Ce : “t Ve+r) and 14^+^ be given 

as in Definition 1^ For each diagram in (4.3) one can consider a similar diagram at the level 
of covers involving covers of the form Ue and Ve, and apply the nerve construction. This 


operation will yield diagrams identical to those in (4.3) where for each e > r: 

. Sy.= N{y),re:=N{Ve), 

• Se,e> ■= N{ue,e>), for r < £ < s'] te,e' ■= N{ve,e’), for r < £ < £'; ip^ '.= N{Z), and 

V'S := N{y. 

To satisfy De£nition|^ it remains to verify conditions (i) to (iv). We only verify (i), since 
the proof of the others follows the same arguments. For this, notice that both the composite 


map fe+ri o Ce and Ue,e+ 2 ri are maps of covers from 14^ to I4e+2r)- By Lemma 2.1 we then have 
that and N{us,e+ 2 ri) = Se,e+ 2 ri are contiguous. But, by the properties of the nerve 

construction A^(^e+^ o Q) = o N{Q) = 'ipe+ri ° Pei which completes the claim. □ 

From now on, for a hnite tower of simplicial complexes © and fc G N, we denote by 
DfcG the fc-th persistence diagram of G with coefficients in a fixed field. Notice that, since 
contiguous maps induce identity maps at the homology level, applying the simplicial ho¬ 


mology functor with coefficients in a field to diagrams such as those in (4.3) yields two 


persistence modules strongly interleaved in the sense of [6] . Thus, we have a stability result 
for DfcMM(il, /) when / is kept fixed but the tower of covers il is perturbed. 

Corollary 4.3. For rj > 0, let il and QJ be two finite towers of covers of Z with res(il) = 
res(2J) > 0. Let f : X ^ Z be well-behaved and il and be rj-interleaved. Then, MM(il, /) 
and MM(QJ, /) are rj-interleaved. In particular, the bottleneck distance between the persis¬ 
tence diagrams DfcMM(il, /) and DfcMM(QJ, /) is at most rj for all k gN. 
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4.2 Stability of Multiscale Mapper 

The fixed simplicial complex produced by the standard mapper may not admit a simple 
notion of stability. We elaborate this point by an example in Appendix In this section, 
we show that the multiscale mapper on the other hand exhibits a stability property against 
perturbations both in functions and in tower of covers (the latter of which already discussed 
in Corollary |4.3[ ). However, to guarantee stability against changes in functions, it is necessary 
to restrict the multiscale mapper to a special class of towers of covers, called (c,s)-good tower 
of covers. We justify the necessity of considering such (c, s)-good tower of covers in Appendix 

EH 

In what follows, we assume that the target compact topological space Z is endowed with 
a metric dz- For a subset O G Z, let diam((9) denote its diameter, that is, the number 
supz,z'&odz{z, z'). For <5 > 0, let denote the set {z G Z\ dz{z,0) < 5}. 

Definition 7 (Good tower of covers). Let c > 1 and s > 0. We say that a finite tower of 
covers 2IJ = {Wg for the compact metric space {Z,dz) is {c,s)-good if: 

1. res(QII) = s, and s < diam(Z); 

2. diam(fF) < e for all W G We and all e > s; and 

3. for all O C Z with diam(O) > s, there exists W G Wc.diam(o) such that hF ^ O. 


Intuitively, in a (c, s)-good tower of covers, the parameter e of a cover We G 2IJ is a scale 
parameter, which specifies an upper-bound for the size / resolution of cover elements in We- 
Condition-3 in the above definition requires that any set O G Z should be covered by an 
element from a cover of Q2J whose resolution is comparable to the size (diameter) of O. See 
Appendix for more discussions on the (c, s)-good tower of covers and its properties. In 

we how that such a tower of covers is somewhat necessary to 
The (c, s)-good tower of covers is also computationally feasible. 


particular, in Appendix p-l 
obtain a stability result 


In Appendix |B.2| we provide an example of a construction of a tower of covers for compact 
metric spaces with bounded doubling dimension using £-nets. 

A priori the conditions defining a (c, s)-good tower of covers do not guarantee that the 
sets O in Z with diameter smaller than the resolution s can be covered by some element W 
of some We, for some e > s. The following proposition deals with this situation and is used 
later. 


Proposition 4.4. If Z is path connected and ii is a {c,s)-good tower of covers of Z, then 
whenever O G Z is s.t. diam(O) < s, there exists W G Wc.(diam(o)+2s) s.t. W D O. 

Remark 4.1. Note that for a small set O , diam(O) < s, one obtains only that O G W for 
some W in Wc.(diam(o)+ 2 s)- This is in contrast with what property 3 above guarantees for sets 
with diameter diam(O) > s: the existence ofW& Wc.diam(o) with O G W. 

^This condition is related to the concept of Lebesgue number for covers of metric spaces where metric 
balls instead of sets with bounded diameter are used. 
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Proof. Indeed, since Z is path connected then one has that diani(0*) > niin{diani(Z), s} = 
s it follows by the dehnition that 3W' G Wc.diam( 0 '>) containing O. We conclude since 
diani(O^) < diani(O) + 2s, O C O®, and since there exists W G Wc(diam(o)+ 2 s) containing 


W'. 


□ 


We will henceforth assume that {Z,dz) is a compact path connected metric space. 


Next we prove the main stability result. Theorem 4.8 Specihcally, in Section 4.3 we hrst 


inspect the impact of function perturbation, and we give the general result in Section 4.4 


4.3 Stability against function perturbation 

Our study of stability against function perturbation involves a reindexing of the involved 
towers of covers. 

Definition 8 (Reindexing). Let OR = {fWe a tower of covers of Z with 

r = res(2II) > 0 and cf : M"*" —)■ M"*" he a monotonically increasing function. Consider the 
tower of covers R<^(QR) := {Vg 

• Ve ■= W 0 -i(£) for each e > (fir) and 

• Ve^e' = fov E^s' such that 0(r) < e < s'. 

We refer to R<^(QIf) as a reindexed tower. 

In our case, we will use the log function to reindex a tower of covers OR. We also need 
the following dehnition in order to state the stability results. 

Definition 9. Given a tower of covers IX = {Us} and Eq > res(il), we define the Eo-truncation 
of iX as the tower Treg(il) := {Us} Observe that, by definition res(Tr£(,(il)) = eo- 

Proposition 4.5. Let X be a compact topological space, {Z,dz) be a compact path con¬ 
nected metric space, and f,g\X Z he two continuous functions such that for some 
5 > 0 one has that max^^x dzifix), gix)) = 6. Let 2R be any ic,s)-good tower of covers of 
Z. Let Eq = max(l,s). Then, the logEo-truncations o/Riog(/*(QR)) and Riog((y'*(2R)) are 
log (2cmax((5, s) + c) -interleaved. 

Proof. For notational convenience write r] := log (2c max(5, s)+c), {Ut] = if := /*(2R), and 
{Vt} = QJ := g*iW). With regards to satisfying Dehnition for Riog(il) and Riog(fR), for 
each £ > log Eq we need only exhibit maps of covers (s ■ Wexp(£) —t Vexp(£+? 7 ) and fs ■ Vexp(£) —t 
Wexp(£+r 7 )- We hrst establish the following. 

Claim 1. For all O C Z, and all 5' > 6 , f~^{0) C g~^iO^'). 

Proof. Let x G /“^(O), then dzifix),0) = 0. Thus, 

dzigix), O) < dzifix), O) + dz(g(x), fix)) < 5, 
which implies the claim. □ 
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Now, pick any t > Sq, any U G Ut, and fix 5' := niax( 5 , s). Then, there exists W G Wt 
snch that U G cc{f~^{W)). The claim implies that f~^{W) C g~^{W^'). Since QU is a (c, s)- 
good cover of the connected space Z and s < max(( 5 , s) < diam(hT^ ) < 25 ' + t, there exists 
at least one set W G Wc(25'+i) snch that C W'. This means that U is contained in 
some element of cc{g~^{W')) where W G Wc(2S'+t)- Bnt, also, since c( 25 ' + t) < c( 25 ' + l)f 
for t > Eo > 1, there exists W" G Wc(2<5'+i)t snch that W C W”. This implies that U 
is contained in some element of cc{g~^{W'')) where W" G Wc(25'+i)t- This process, when 
applied to all U G Ut, all t > £o, dehnes a map of covers C,t '-Ut ^ V(2c(S'+c)n Now, dehne for 
each e > log(eo) fhe map := Cexp(£) and notice that by constrnction this map has Wexp(e) as 
domain, and V(2Vc+c) exp(£) as codomain. A similar observation prodnces for each e > log(eo) 
a map of covers from Vexp(£) to V(2M'+c)exp(£)- 

Notice that for each £ > log(£o) one may write (2c5' + c) exp(£) = exp(£ + g). So we have 
in fact proved that log £o-trnncations of Riog (it) and Riog (2J) are r^-interleaved. □ 


An application 


of Proposition 4.5| and the resnlt in [H] 


is: 


Corollary 4.6. Let OR he any {c,s)-good tower of covers of the compact connected metric 
space Z and let f,g : X ^ Z be any two well-behaved continuous functions such that 
ma.Xx£x dzif (x), g{x)) = 6. Then, the bottleneck distance between the persistence diagrams 
DfcMM(Riog(2IJ), /) and DfcMM(Riog(2II), is at most \og{2c max(s, 5) + c) + max(0, log 2) 
for all k eN. 


Proof. We nse the notation of Proposition 4.5 Let it = /*(2II) and QJ = S'*(OR). If 
max(l,s) = s, then Riog(il) and Riog(2J) are log(2cmax(s, 5) + c)-interleaved by Proposi¬ 
tion 4.5 which gives a bonnd on the bottleneck distance of log(2cmax(s, 6 ) -L c) between the 
corresponding persistence diagrams [6]. In the case when s < 1, the bottleneck distance 
remains the same only for the 0-trnncations of Riog (it) and Riog(2J). By shifting the starting 
point of the two families to the left by at most | logs| can introdnce barcodes of lengths at 
most log - or can stretch the existing barcodes to the left by at most log - for the respective 
persistence modnles. To see this, consider the persistence modnle below where £1 = log s: 


H4iV(/*(W,J)) ^ H,(iV(/*(W,J)) ^.^ H,(iV(/*(Wo))) ^ ^ Hfc(iV(/*(W,J)) 

A homology class born at any index in the range [log s, 0) either dies at or before the 
index 0 or is mapped to a homology class of IIfc(W(/*(Wo))). In the hrst case we have a 
bar code of length at most | log s| = log In the second case, a bar code of the persistence 
modnle 


H47V(/*(W,,=o))) ^ ^ H,(iV(/*(W,J)) 

starting at index log 1 = 0 gets stretched to the left by at most |logs| = log 2. The 
same conclnsion can be drawn for the persistence modnle indnced by Riog(2J). Therefore 
the bottleneck distance between the respective persistence diagrams changes by at most 
logf □ 


Remark 4.2. The proposition and corollary above are in appearance somehow not satis¬ 
factory: imagine that f = g, then 5 = 0 but by invoking Proposition instead of a 
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^-interleaving one obtains a (log(c(2s + 1)) ma.x{0, log ^))-interleaving. Nevertheless, the 

claim of the proposition is almost tight for 5 > 0; in fact, by modifying the example in { B.l 
for each 5 > we can find a space Xs, a (c, s)-good tower of covers of a closed interval 
/ C M, and a pair of functions fs, gs ■ X ^ I such that \\fs — gsWoo = d but such that the 
bottleneck distance between the persistence diagrams of fs) and MM(it, is at least 

max{log(cs), logc, log log(c(2s + 1)) + max(0, log ^)). 


4.4 Stability in general 

We now consider the more general case which also allows changes in the tower of covers 
inducing the multiscale mapper. 


Theorem 4.7. Let il and be any two p-interleaved (c, s)-good towers of covers of the com¬ 
pact connected metric space Z and let f,g : X ^ Z be any two continuous well-behaved func¬ 
tions such that dz{f{x), g{x)) < S. Then, for Eq = max(l,s), the logEo-truncation 

o/ Riog(/*(il)) and Riog (s'* (2J)) are log (2cmax(s, 5) + c-\-p)-interleaved. 


Proof. Write and T1„(U) = {n}r>». Denote /•(») = 

<,•(«) = and /-(it) = {«/}.>„, ff-(U) = {Wh>.o. By following the argument 

and using the notation in Proposition 4.5, for each e > Eq one can hnd maps of covers 


— )■ W^2c(5'+c)£ '^^2cS'+c)£- ARot since it'f and are r^-interleaved, for each e > Eg 

there are maps of covers Uf —)■ and V| —)■ Then, for each e > Eg one can form 


the following diagram 


^ ^£(2c5'+c) ^ £{2c 5'+c)+ri ^ '^£(2c5'+c+r;) ’ 

where the last step follows because since £ > £o > ft then we have that e(2c(5' + c) + p < 
e{2c5' + c + p). Thus, by composing the maps intervening in the diagram above we have 
constructed for any e > Eg a. map of covers U[ —)■ V^( 2 c 5 '+c+)?)- ^ similar manner one can 

construct a map of covers V| —;■ ki^^^cS'+c+ri) ^ ^ ^o- 

This provides the maps ^ ^JcV+c+r,) exp(e) and ^ exp(.) each 

e > £o- Since {2c6' + c + r/) exp(£) = exp(£ + log{2c6' + c + r/)), by reindexing by log we 
obtain that Riog(/*(Tr£Q(il))) and Riog( 5 '*(Tre(,(QJ))) are log{2c5' + c + ? 7 )-interleaved. □ 

Similar to the derivation of Corollary |4.6| from Proposition |4.5[ we get the following result 
from Theorem 14.71 

Theorem 4.8. Let il and QJ be any two p-interleaved, {c,s)-good tower of covers of the 
compact path connected metric space Z and let f,g : X ^ Z be any two well-behaved 
continuous functions such that dz{f{x), g{x)) < 6. Then, the bottleneck distance 

between DfcMM(Riog(il),/) and DA;MM(Riog(QJ), ( 7 ) is bounded by log {f2cm.ax{s, 5)c + p)-\- 
max( 0 , log for all k e'H. 
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5 Exact Computation for PL-functions on simplicial 
domains 


The stability result in Theorem 4.8 further motivates us to design efficient algorithms for con¬ 
structing multiscale mapper or its approximation in practice. We justify the approximation 
algorithms using the stability results discussed above. 

One of the most common types of input in practice is a real-valued piecewise-linear (PL) 
function / : \K\ —)■ M defined on the underlying space \K\ of a simplicial complex K. That 
is, / is given at the vertex set V(iL) of K, and linearly interpolated within any other simplex 
aeK. 

In what follows, we consider this PL setting, and show that interestingly, if the input 
function satisfies a mild “minimum diameter” condition, then we can compute both mapper 
and multiscale mapper from simply the 1-skeleton (graph structure) of K. This makes the 
computation of the multiscale mapper from a PL function signihcantly faster and simpler as 
its time complexity depends on the size of the 1-skeleton of K, which is typically orders of 
magnitude smaller than the total number of simplices (such as triangles, tetrahedra, etc) in 
K. 


Given a simplicial complex K, let Ki denote the 1-skeleton of K: that is, Ki contains 
the set of vertices and edges of K. Define / : |iLi| —)■ M to be the restriction of / to |iPi|; 
that is, / is the PL function on |iPi| induced by function values at vertices. 


Condition 5.1 (Minimum diameter condition). For a given tower of covers HO of a com¬ 
pact connected metric space {Z,dz), let k(HO) := inf{diam(hP); IT G W G 211} denote the 
minimum diameter of any element of any cover of the tower HO. Given a simplicial complex 
K with a function f : \K\ —)■ Z and a tower of covers QU of the metric space Z, we say that 
{K,f,HO) satisfies the minimum diameter condition if dia.m{f [a)) < /s;(QIf) for every simplex 
aeK. 


In our case, / is a PL function, and thus satisfying the minimum diameter condition 
means that for every edge e = {u^v) G iPi, \f{u) — f{v)\ < k(QII). In what follows we 
assume that K is connected. We do not lose any generality by this assumption because the 
arguments below can be applied to each connected component of K. 

Definition 10. Two towers of simplicial complexes & = {iSg iSe'} and T = 7^/} 

are isomorphic, denoted © = T, z/res(©) = res(T), and there exist simplicial isomorphisms 
T}^ and Tj^r such that the diagram below commutes for all res(©) < e < e'. 


Se 




Vs 

% 



Our main result in this section is the following theorem which enables us to compute the 
mapper, multiscale mapper, as well as the persistence diagram for the multiscale mapper of 
a PL function / from its restriction / to the 1-skeleton of the respective simplicial complex. 
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Theorem 5.1. Given a PL function / : |i^| —)■ M and a tower of covers QU of the im¬ 
age of f with {K,f,W) satisfying the minimum diameter condition, one has MM(QU,/) = 
MM(2IJ, /). 


We show in Proposition 5.2 that the two mapper outputs M(W, /) and M(W, /) are 


identical up to a relabeling of their vertices (hence simplicially isomorphic) for every W G QU. 
Also, since the simplicial maps in the hltrations MM(QU, /) and MM(QU, /) are induced by 
the pullback of the same tower of covers QU, they are identical again up to the same relabeling 
of the vertices. This then establishes the theorem. 

In what follows, for clarity of exposition, we use X and Xi to denote the underlying 
space lA'I and \Ki \ of K and Ki, respectively. Also, we do not distinguish between a simplex 
a ^ K and its image | ct | C W and thus freely say cr C X when it actually means that | ct | C X 
for a simplex cr G X. 


Proposition 5.2. If (X,/, QU) satisfies the minimum diameter condition, then for every 
W G QU, M(>V, /) is identical to M(W, /) up to relabeling of the vertices. 


Proof. Let U = /*(W) and U = /*(W). By dehnition of /, each f7 G W is a connected 
component of some f/ fl Xi for some U E U. In Proposition 5.3, we show that U (1 Xi is 
connected for every U eU. Therefore, for every element U E U, there is a unique element 
U = 1/ n Xi in W and vice versa. Claim hnishes the proof. □ 

Claim 2. fllLi if and only if Ui 7 ^ 0. 


Proof. Clearly, if n^=i 7 ^ 0) then nf=i Ui ^ ^ because Ui C Ui. For the converse, assume 
riiLi Ui yl ^ and pick any x E fliLi Ui. Let cr C X be a simplex which contains x. Consider 
the level set L = f~^{f{x)). Since / is PL, Lg. = L fl cr is connected and contains a point 
y E a n Xi. Then, y E Ui for each i E {1,... ,k} because x and y are connected by a path 
in f/j n cr and /(x) = f{y). Therefore, c/ G Hi Ui. Since c/ G Xi it follows that 2/ G Hi Ui. □ 


Proposition 5.3. If {X,f,iIO) satisfies the minimum diameter condition, then for every 
W G QU and every U E /*(W), the set f/ fl Xi is connected. 


Proof. Fix U E /*(W). If 17nXi is not connected, let Ci,... ,Ck denote its fc > 2 connected 

components. First, we show that each Ci contains at least one vertex of Xi. Let e = {u,v) 

be any edge of Xi that intersects U. If both ends u and v lie outside U, then \f{u) — f{v)\ > 
|max[// — min; 7 /| > k(QU). But, this violates the minimum diameter condition. Thus, at 
least one vertex of e is contained in U. It immediately follows that Ci contains at least one 
vertex of Xi. 

Let A be the set of all simplices cr C X so that cr n f/ 7 ^ 0. Fix cr G A and let x be any 
point in cr n f/. 

Claim 3. There exists a point y in an edge of a so that f{x) = f{y). 

Proof. We hrst observe that V’''(cr) := {n G V(cr)|/(n) > /(x)} is non-empty. Otherwise 
f{v) < /(x) for for all vertices v E a. In that case /(x') < /(x) for all points x' E a 

contradicting that x belongs to cr. Now, if V’''(cr) = V(cr), there is an edge e C cr so that 

X G e and taking y = x serves the purpose. If V'''(cr) 7^ V(cr), there is a vertex u E V(cr) 
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so that f{u) < f{x). Taking a vertex v from the non-empty set we get an edge 

e = {u,v) where f{u) < f{x) and f{v) > f{x). Since / is PL, it follows that e has a point y 
so that f{y) = /{x). □ 

Since a contains an edge e that is intersected by U, it contains a vertex of e that is 
contained in U. This means every simplex a G A has a vertex contained in U. For each 
i = 1 ,..., fc let Aj := {a C A | V(cr) fl Cj 7 ^ 0.} Since every simplex a G A has a vertex 
contained in U, we have A = IJ. A*. We argue that the sets Ai,..., A^ are disjoint from each 
other. Otherwise, there exist i ^ j and a simplex a with a vertex u in Aj and another vertex 
V in Aj. Then, the edge {u,v) must be in U because / is PL. But, this contradicts that Ci 
and Cj are disjoint. This establishes that each Aj is disjoint from each other and hence A 
is not connected contradicting that U is connected. Therefore, our initial assumption that 
f/ n Ai is disconnected is wrong. □ 


5.1 Real-valued functions on triangulable topological spaces 

We note that, by using PL functions to approximate the continuous functions, the above 
result also leads to an approximation of the multiscale mapper for real-valued functions 
dehned on triangulable topological spaces. 

A PL function / : \K\ —)■ M 5-approximates a continuous function : A —)■ M dehned 
on a topological space A if there exists a homeomorphism h : A —)■ |A| such that for any 
point y G |A|, we have that \f{y) —go h~^{y)\ < 6. The following result states that if a 
PL function ^-approximates a continuous real-valued function on A, then the persistence 
diagrams induced by the respective multiscale mappers are also close. 


Corollary 5.4. Let f : \K\ —)■ M 6e a piecewise-linear function on K that 5-approximates a 
continuous function : A —>■ M. Let HO be a (c, s)-good tower of covers o/M. Then the bottle¬ 
neck distance between the persistence diagrams DfcMM(Riog(2IJ),/) and DfcMM(Riog(2IJ), 
is at most log ( 2 cmax(s, 5) -|- c) -|- max( 0 , log 7 ) for all k eN. 


Proof. Let g : \K\ —)■ M denote the push forward of 5 ^ : A —)• M by the homeomorphism 
K\. By the dehnition of ^-approximation, we know that ||/ — ^||cxd A Hence by 
the bottleneck distance between the persistence diagrams DfcMM(Riog(Q 2 J), /) 


4.6 


h:X 
Corollary 

and DfcMM(^Riog(2II), is at most log(2cmax(s, 5) -f c) -|- max(0, log for all A; G N. 

On the other hand, since h is homeomorphism, it is easy to verify that the pullback 
of HO via g and via g induce isomorphic persistence modules: Hfc(MM(Riog(QIl), f/))) = 
Hfc(MM(Riog(2II), ^))). Combining this with the discussion in previous paragraph, the claim 
then follows. □ 


6 Approximating multiscale mapper for general maps 

While results in the previous section concern real-valued PL-functions, we now provide a 
signihcant generalization for the case where / maps the underlying space of K into an 
arbitrary compact metric space Z. We present a “combinatorial” version of the (multiscale) 
mapper where each connected component of a pullback f~^{W) for any cover W in the 
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f-Hw) 


Figure 1: Partial thickened edges belong to the two connected components in / ^iW). Note 
that each set in ccG{f~^{W) contains only the set of vertices of a component in cc(/“^(hF)). 


cover of Z consists of only vertices of K. Hence, the construction of the Nerve complex 
for this modihed (multiscale) mapper is purely combinatorial, simpler, and more efficient to 
implement. But we lose the “exactness”, that is, in contrast with the guarantees provided 
by Theorem 5T, the simpler combinatorial mapper only approximates the actual multiscale 
mapper at the homology level. Also, it requires a (c, s)-good tower of covers of Z. One 
more caveat is that the towers of simplicial complexes arising in this case do not interleave 
in the (strong) sense of Dehnition but in a weaker sense. This limitation worsens the 
approximation result by a factor of 3. 

In what follows, cc(0) for a set O denotes the set of all path connected components of 

O. 


Given a map / : |iF| —?■ Z defined on the underlying space \K\ of a simplicial complex K, 
to construct the mapper and multiscale mapper, one needs to compute the pullback cover 
/*(W) for a cover W of the compact metric space Z. Specifically, for any IF G W one needs 
to compute the pre-image f~^{W) C \K\ and shatter it into connected components. Even 
in the setting adopted in ^ where we have a PL function / : \Ki\ —>■ M dehned on the 
1-skeleton Ki of K, the connected components in cc{f~^{W)) may contain vertices, edges, 
and also partial edges: say for an edge e G iPi, its intersection ew = e O f~^iW) C e, 
that is, f{ew) = /(e) fl W, is a partial edge. See Figure for an example. In general for 
more complex maps, a fl for any fc-simplex a may be partial triangles, tetrahedra, 

etc., which can be nuisance for computations. The combinatorial version of mapper and 
multiscale mapper sidesteps this problem by ensuring that each connected component in the 
pullback f~^(lV) consists of only vertices of K. 


6.1 Combinatorial mapper and multiscale mapper 

Let G be a graph with vertex set V(G) and edge set E(G). Suppose we are given a map 
/ : V(G) —)■ Z and a hnite open cover W = {WajaeA of the metric space {Z,dz)- For any 
Wa G W, the pre-image f~^(lVa) consists of a set of vertices which is shattered into subsets 
by the connectivity of the graph G. These subsets are taken as connected components. We 
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Algorithm 1 Combinatorial Multiscale Mapper 

Input: / : I A| -)■ Z given by /y : C (K) -)■ Z, a tower of covers 2IJ = {Wi,..., WJ 
Output: Persistence diagram D*(MM(Q2J,/)) 

for i = 1,..., f do 

compute Vw ^ where f{Vw) = fiy{K)) fl W and {Pjyjj = ccki{Vw), for 

VfP G >Vi; 

compute Nerve complex Ni = N{{V^}j^w)- 

end for 

compute D*({Aj —)■ Aj+i, i E [l,t — 1]}). 


now formalize this: 

Definition 11. Given a set of vertices O C V(G), the set of connected components of O 
induced by G, denoted by ccg{0), is the partition of O into a maximal subset of vertices 
connected in Gq ^ G, the subgraph spanned by vertices in O. We refer to each such maximal 
subset of vertices as a G-induced connected component of O. We define /*°(W), the G- 
induced pull-back via the function f, as the collection of all G-induced connected components 
ccG{f~^{Wa)) for all a E A. 

Definition 12. (G-induced multiscale mapper) Similar to the mapper construction, we define 
the G-induced mapper M'^(W,/) as the nerve complex N{f*o(yV)). 

Given a tower of covers Q2J = {We} of Z, we define the G-induced multiscale mapper 
MM‘^(2IJ,/) as the tower of G-induced nerve complexes {N{f*o(yv^)) \ W^ G QU}. 

6.2 Advantage of combinatorial multiscale mapper 

Given a map / : \K\ -E Z defined on the underlying space \K\ of a simplicial complex K, let 
fv '■ V(A) —)■ M denote the restriction of / to the vertices of K. Consider the graph Ki as 
providing connectivity information for vertices in V{K). Given any tower of covers Q2J of the 
metric space Z, the Ai-induced multiscale mapper MM^^(2n, fv) is called the combinatorial 
multiscale mapper of f w.r.t. 2IJ. 

A simple description of the computation of the combinatorial mapper is in Algorithm 
For the simple PL example in Figure f~^{W) contains two connected components, one 
consists of the set of white dots, while the other consists of the set of black dots. More 
generally, the construction of the pullback cover needs to inspect only the 1-skeleton Ki 
of K, which is typically of significantly smaller size. Furthermore, the construction of the 
Nerve complex Ni as in Algorithm is also much simpler: We simply remember, for each 
vertex v E V{K), the set ly of ids of connected components {P(y}j,weWi which contain it. 
Any subset of ly gives rise to a simplex in the Nerve complex N^. 

Let MM(2IJ, /) denote the standard multiscale mapper as introduced in ^ Our main 
result in this section is that if Q2J is a (c, s)-good tower of covers of Z, then the resulting two 
towers of simplicial complexes, MM(2IJ, /) and MM^^(2IJ, fv), interleave in a sense which is 
weaker than that of Definition]^ but still admits a bounded distance between their respective 
persistence diagrams as a consequence of the weak interleaving result of |S]. This weaker 
setting only worsens the approximation by a factor of 3. 
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Theorem 6.1. Assume that {Z,dz) is a compact and connected metric space. Given a map 
/ : \K\ Z defined on the underlying space of a simplicial complex K, let fv : V(i^) —)■ Z 
be the restriction of f to the vertex set V(i^) of K. 

Given a {c,s)-good tower of covers W of Z such that {K,f,W) satisfies the minimum 
diameter condition (cf. Gondition \5. l\ l, the bottleneck distance between the persistence dia¬ 
grams DfcMM(Riog(Q2J), /) and (Riog(2II), fv) is at most 3 log(3c) + 3 max(0, log 

for all k eN. 


is (c, s)-good, and we 


The remainder of this section is devoted to proving Theorem ED 
In what follows, the input tower of covers 2IJ = jWe —A 
set p = 3c. 

For each e > s, let denote the the nerve complex iV(/*(We)) and denote the the 
combinatorial mapper N{fy^^ (^e))- Our goal is to show that there exist maps and so 
that diagram-(A) below commutes at the homology level, which then leads to a weak logp- 
interleaving at log-scale of the persistence modules arising fr om computing the homologies 
of MM(2IJ, /) and MM^^(Q2J, fv). This then proves Theorem 


6.1 


(^) 


(B) 


(6.4) 



In the diagram above, the map 6 ^^^/ := N{f*{w^^^i)) : Mg —)■ Mg/ is the simplicial map 
induced by the pullback of the map of covers We,e' '■ kVe —?■ We'. Similarly, the map dff : = 
N{we,e’)) '■ —)■ M^^ is the simplicial map induced by the pullback of the map of 

covers tCg^g/ : Wg —)■ Wg/. 


The maps z/g. Since connectivity in the 1-skeleton Ki implies connectivity in the underly¬ 
ing space of K, there is a natural map from elements in ccxiiffi^iW)) to cc(/“^(iy)), which 
in turn leads to a simplicial map z/g : M^^ —)■ Mg. 

The details are as follows. Given a nerve complex Mg (resp. M^^), we abuse the notation 
slightly and identify a vertex in Mg with its corresponding connected component in f*{We) 
(resp. in fy'^^fWe)). 

First, note that by construction of the combinatorial mapper, there is a natural map uw : 
ccKiiffi^{W)) —)■ cc{f~^{W)) for any W E Wg: For any V E ccxiiffi^iW)), obviously, all 
vertices in V are iCi-connected in f~^(W). Hence there exists a unique set Uv E cc{f~^(W)) 
such that V C Uv- We set uw{V) = Uv for each V E ccKiiffi^i^))- The amalgamation of 
such maps for all IV E We gives rise to the map z/g : f^^^{We) —t f*{We)) whose restriction 
to each ccKi{fv^{W)) is simply uw as dehned above. Abusing notation slightly, we use 
z/g : V(M^i) —)■ V(Mg) to denote the corresponding vertex map as well. Since for any 
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set of vertices V G we have that V C Ve{V) C \K\. It then follows that non¬ 

empty intersections of sets in (hVe) imply non-empty intersections of their images via 
Ve in f*{y^e)- Hence this vertex map induces a simplicial map which we still denote by 
I/, : Mf ^ M,. 


Auxiliary maps of covers /ig. What remains is to dehne the maps in diagram-(A). 
We hrst introduce the following map of covers: /ig : We —)■ Wpe for any e > s. Specihcally, 
given any W G W^ and s > 0, let = {x e Z \ dz{x,W) < s}. Since QU is (c, s)-good, 
there exists at least one set W G Wpe such that C W' . This is because for any e > s: 

s < diam(W®) < diam(IT) -|- 2s = e -|- 2s < Se, 

so that then c ■ diam(W^) < See = pe. We set PsiW) = W'\ There may be multiple choices 
of sets in Wpe that contains W, we pick an arbitrary but hxed one. Let Sg : —>■ Mp^ 

and ts : —)■ denote the simplicial map induced by the pullbacks of the cover 

map Pe '■ We —)■ Wpe via / and via /y, respectively. In other words, s^ = N{f*{pe)) and 
ts = N{r/^{Pe)). 


The maps 0e. We now dehne the map 0^ : —>■ with the help of diagram-(B) in 

(6.4). Fix W G We. Given any set U G cc(/“^(IF)), write {Vaj/jeAy for the preimage 

of U under the vertex map Ue- Note that Vg C G for any 0 G Au and U/ 3 eAy ^ ~ V(-^) G U. 

We claim that teiVji) = teiV^i) for any 0, f3' G Au- Indeed, since Vg and Vg/ are contained 
in the path connected component G, let 7 i{x,y) C f/ be any path connecting a vertex x G Vg 
and y E Vjs/ in U. Let {ai,... ,aa} be the collection of simplices that intersect 7 i{x,y). By 
the minimum diameter condition (cf. Condition 5.1) and the dehnition of the map Pe, we 
have that /(eXj) C C Pe{W) G Wpe, for any j G {1,..., a}. 

Hence I// 3 U Vg/ U{cti, ..., exa} C Se{U) G cc{f~^{pe(W))). Furthermore, since the edges of 
simplices {cxi,..., cXq} connect x G Vg and ?/ G Vg', vertices in Vg and Vpi are thus connected 
by edges in Se{U). That is, there exists a Ayinduced component (a subset of vertices of K) 
V G ccKi{fv^{Pe{W)))) such that U Vg/ C 1/ for any 0,0' G Au. Hence, all Vgs with 
0 G Afj have the same image V in V(M^0 under the map and we set (j)e{U) := V to be 
this common image. 


Lemma 6.2. The vertex map 0^ introduced above induces a simplicial map which we also 
denote by 0£ : —)■ . 

Proof. We have already proved in the main text that for all (3 G Au, tefVy) has the same 
image V. Using the same notation as in the main text, recall that 


(*) for any 0 G Au, and IJ = V(A) n U. 

y&Au 


This implies that V(A) fl U C (feifJ). In fact, a similar argument can also be used to prove 
the following: 

Claim 4. The vertices of any simplex that intersects U will be contained in (feifJ). 
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Now to prove Lemma 6.2, we need to show the following: given a fc-simplex r = 
• iPfc} £ Me, where each vertex pj corresponds to set Uj C |iL|, then we have 
that ^ 0. 

To prove this, take any point x ^\K\ such that x G 

Suppose X is contained in a simplex a E K. By Claim the vertices of a are contained 
in (j)s{Uj), that is, V(a) C (j)e{Uj), for any j G {0,..., k}. Hence D V(a) ^ 0. 

The lemma then follows. □ 


Finally, by the construction of 0^, the lower triangle in diagram-(B) in (6.4) commutes. 
This fact, the commutativity of the square in diagram-(B), and the dehnition of together 
imply that the top triangle commutes. Furthermore, Lemma 2.1 states the two maps of 
covers /*(we,pe) and f*{pe) induce contiguous simplicial maps and s^. Similarly, 
is contiguous to ts- Since contiguous maps induce the same map at the homology level. 


it then follows that diagram-(A) in (6.4) commutes at the homology level. At this point 
one could apply the strong interleaving result of [6] had the maps 0^ been dehned for any 
Mf. —)■ M^L Nonetheless, the Weak Stability Theorem of [U] can be applied even in this case. 


This, together with an argument similar to the proof of Corollary |4.6[ provides Theorem 6.1 


7 The metric point of view 

The multiscale mapper construction from previous sections yields a tower of simplicial com¬ 
plexes, and a subsequent application of the homology functor with held coefficients provides 
a persistence module and hence a persistence diagram. 

An alternative idea is to use the data (il, / : X —)■ Z) to induce a (pseudo-)metric on X, 
and then consider the Rips or Cech hltrations arising from that metric, and in turn obtain a 
persistence diagram. Both viewpoints are valid in that they both produce topological sum¬ 
maries out of the given data. The hrst approach proceeds at the level of hltrations and then 
simplicial maps, and then persistent homology, whereas the other readily produces a metric 
on X and then follows the standard metric point of view with geometric complexes. Notice 
that these two approaches yield diherent computational problems: the MM approach leads 
to persistence under simplicial maps [15] , and the metric approach leads to persistence under 
inclusion maps, which appears to be computationally easier with state-of-the-art techniques. 

7.1 The pull-back pseudo-metric 

Assume that a continuous function f : X ^ Z and a tower of covers il with res(il) > 0 of 
Z are given. We wish to interpret e as the size of each U & Ue at least for e > s when il is 
a (c, s)-good tower of covers. We dehne the function dixj : X x X —)■ IR+ for x,x' G X s.t. 
X ^ x' hy 

dix,f{x,x') := inf {e: > 0| 3V G f*{Ue) with x, x' G R}, and d^j{x,x) := 0 for all x G X. 

(7.5) 

Notice that dixj{x,x') > s whenever x 7 ^ x'. We refer to duj as the pull-back pseudo-metric. 

Lemma 7.1. If Zi is a {c,s)-good tower of covers of the compact connected metric space Z, 
then dixj satisfies: 
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• dixj{x,x') = dixj{x',x) for all x,x' G X. 

• dixj{x, x') < c ■ [diij{x, x”) + dixj{x", x') + 2s) for all x, x', x” G X. 

Proof of Lemma \7.1\ That the dehnition yields a symmetric function is clear. The second 
claim follows from the c-goodness of if and Proposition |4.4[ Indeed, assume that Vi G /*(f4i) 
is s.t. x^x" G Vi and V 2 G f*{Ue 2 ) is s.t. x'\x' G V 2 ■ Then there exists Ui G s.t. 
Vi G cc(/“^(f/i)), and there exists 1/2 G s.t. V 2 G cc(f~^(U 2 )). Clearly, Pi fl V 2 7 ^ 0, 

implying f/i fl f /2 7 ^ 0- Thus diam(f/i U U 2 ) < £1 + £ 2 - The proof concludes via Proposition 

an □ 

Remark 7.1. Note that despite the fact that we call d^j a pseudo-metric, it does not satisfy 
the triangle inequality in a strict sense. According to the lemma above it does, however, 
satisfy a relaxed version of such inequality. Furthermore, in the ideal case when c = 1 and 
s = 0, dixj satisfies the triangle inequality. Thus, in what follows we will take the liberty 
of calling it a pseudo-metric. Furthermore, this relaxed triangle inequality does not preclude 
our ability to consider the induced Cech complex and to develop the interleaving results in 
Section 


7.2 Stability of d\xj 

In the same manner that we established the stability of the multiscale mapper construction, 
one can answer what is the stability picture for the Cech construction over (X, dn,/)- A hrst 
step is to understand how d^j changes when we alter both the function / and the tower if. 

Proposition 7.2 (Stability of pull-back pseudo-metric under cover perturbations). Let f : 
X ^ Z be a continuous function and if and QJ be two rj-interleaved towers of covers of Z 
with res(il) > 0 and res(2J) > 0. Then, 


G X, dnj{x,x') < d<uj{x,F) +1]. 

Proof. Assume the hypothesis. Then, by Lemma 4. Hr (if) and /*( 2 J) are //-interleaved as 
well. Pick x,x' G X and assume that dixj{x,x') < e. Let U G f*{lde) be such that x,x' G Lf. 
Then, one can hnd V G V^+ri such that U C V. Then, since x, x' G V, it follows that 

d^j{x,x') <eFr]. 


Since this holds for any e > dixj{x,x'), we obtain that dixj{,x,x') < d<sj{x,x') + 7 ]. □ 

Proposition 7.3 (Stability of pull-back pseudo-metric against function perturbation). Letii 
be a (c, s)-good tower of covers of the compact connected metric space Z and let f,g : X ^ Z 
be two continuous functions such that for some 5 > 0 one has ma.x,cex dz{f{x), g{x)) < 6. 
Then, 

\/x,x' G X, dix,g{.x,x) < c[dixj{x,x') -|- 2max((5, s)). 

Proof. Write if = {Ud\. Let duj{x,x') < e. 

Then there exists U and V G cc{f~^{U)) such that x,x' G V. Write 6 ' = max(h, s). 
We know f~^{U) C g~^(U^'). Since s < diam(17'^') < e + 26' and if is a (c,s)-good tower of 


22 











covers of Z, there exists U' G Uc{e+ 2 S') so that U^' C U'. Thus, V C g~^([/'). It follows by 
dehnition that dix^g{x, x') < c - {e + 25'). Since the argument holds for all £ > 0, we have the 
result. □ 

As a corollary of the two preceding propositions we obtain the following statement: 

Proposition 7.4. Let eo,c > 1 and ri,s > 0. Let il and be any two rj-interleaved, Sq- 
truneations of {c,s)-good towers of eovers of the compact connected metric space Z and let 
f,g:X^Z be any two continuous functions such that max^i^x dz{f (x), g{x)) < 6. Then, 
for allx,x' G X 'y~^-dss,f{x,x') < dix,g{x,x') < 'y-d‘i]j{x, x'), wherej := (2cmax{s, 5)+c+g). 


7.3 Cech filtrations using d^j 

Given any point x E X, we denote the e-ball around x by B^x) = {x' G X | dixj{x,x') < e}. 
We have the following observation. 

Lemma 7.5. Let f : X ^ Z be continuous and il any tower of covers of Z with res(il) > 0. 
Consider the pseudo-metric space {X, dixj). Then, for any x E X and e > 0, 

B,{x) = U 

v&f*(u,),s.t. X eV 


Proof. By dehnition, B^{x) = lj< 5 <e{^ ^ f*(p(s)] x E V}. Now, since the tower il is hierar¬ 
chical (cf. Dehnition]^, then whenever 5 < e and V E f*{Lls) one will also have that there 
exists V' E f*{U^) with V C V'. The conclusion follows. □ 


The fact that as dehned d^j is a non-negative symmetric function on X x X permits 
dehning the Cech (and also Rips) hltration on the powerset pow(X) of X. The Oech filtration 
Cech(il, /) induced by data (il, / : X —)■ Z), is given by the function Fc ■ pow(X) —)■ M where 
for each cr C X, Fc{cr) := inCgx sup 3 ,/g(,. (inj(a;, x'). Specihcally, Cech(il, /) can be written 

as {Cg C£/}s<£<£/ where := {a \ Fc{(t) < e}, and Ls^e' are the natural inclusion maps. 
Note that a = {a:o) • • •, 3:^} G if and only if f). B^Xi) ^ 0. 


Corollary 7.6. Under the hypothesis of Theorem l.f, Cech(Riog(il), /) and Cech(Riog(2J), 
are log (2cmax(s, 6 ) c-\- rj) -interleaved. 


Because of the above corollary, the persistence diagrams arising from the Cech hltration 
on X are stable in the bottleneck distance. 


7.4 An interleaving between MM(il,/) and Cech(it,/) 

Interestingly, it turns out that the two views: the pullback of covers and an induced pullback 
metric, are closely related. Specihcally, by putting together the elements discussed in this 
section we can state a theorem specifying a comparability between MM(il, /) and Cech(il, /) 
under the log-reindexing. 
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Theorem 7.7. Let {Z,dz) be a compact connected metric space, iX be a {c,s)-good tower of 
covers of Z with s > 1, and f : X ^ Z be continuous. Then, the hierarchical families of 
simplicial complexes MM(Riogil, /) and Cech(Riogil, /) are log(c(s + 2 ))-interleaved. 

Corollary 7.8. Under the hypotheses of the previous theorem, the persistence diagrams of 
MM(Riogil,/) and Cech(Riogil,/) are at bottleneck distance bounded by log(c(s + 2 )). 


Proof of Theorem 7^. To fix notation, write it = {U^}, MM(il,/) = {M^ M£/}s<e<e/, 

and Cech(it, /) = {Cg C£/}s<£<£' for the mnltiscale mapper and Cech filtrations, respec¬ 
tively. Recall that a = {a:o) • • • > if and only if ^ 0. 

For each e > s we will define simplicial maps tt^ : —)■ Mc(s+ 2 )£ and —)■ so 

that each of the diagrams below commntes np to contignity: 




(7.8) 


(7.9) 


Recall that the vertex set of is X, whereas the vertex set of is the cover f*{U^). 


The map vr^ : —)■ Mc(<j+ 2 )£- Consider the map : X —>■ f*{Uc{s+ 2 )e)i where x eqnals 

to an arbitrary but fixed 14 £ f*{hlc{s+ 2 )e) such that B^[x) C I 4 . Such a I 4 always ex¬ 
ists. Indeed, note first that f{Bs{x)) = {U G Wg|/(x) G U}. Then, it follows that 
diam(/(Rg(a:))) < 2e. Now, invoking the fact that il is (c, s)-good and Proposition 4.4, and 
noting that c(2e: -i s) < c(s -i 2)e for e > s > 1, we conclude the existence of U E Uc{s+ 2 )e 
such that /(i?g(x)) C U. Finally, pick I 4 G cc(/“^(17)) s.t. B^{x) C 14- 

The map induces the simplicial map vTg : Cg —)> Mc(g+ 2 )g for any e > s, as used in 
diagrams (7.6), (7.7) and (7.8). Indeed, assume a = {xq, ... ,Xk} G Cg. Then, f].B^{xi) 7 ^ 0. 
Since by construction ^g(xj) = 14i ^ B^{xi) for all i, it follows that 4 0 ^-s well. 
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The map —)■ C^. For all V G f*{Us) pick a point xy ^ V: This is a choice of a 

representative for each element of the pullback cover. Dehne = xy- We now check 

that this vertex map induces a simplicial map 'ip;,. Assume that Vq, Vi,..., 14 ^ /^(We) cire 


s.t. f]. Vi 7 ^ 0. Note that by Lemma 7.5, each Vi satishes Vi C B^{xyP). It then follows that 


(4j B^{xyP) 7 ^ 0, implying that xy ^,..., xv*. span a simplex in C^. 


Claim 5. The maps Se,c{s+ 2 )e tt^ o in (7.6) are eontiguous. 


Proof. Assume that I 4 , 14, • ■ ■, 14 £ are such that fljVi 7 ^ 0, and let Ui := Ss^c(s+ 2 )e{Vi) 

for each i. Since Vi C Ui, we have that 51*77 4 rijf/j. On the other hand, let xy^ := 'ipeiVi) (i.e, 
xy^ is the representative of the set Vi). Note that Wi := 7r£(xv'4 satishes IF* 4 B^{xy() 4 14- 
We then have that 0*14 4 0*74 fl OjWj. Thus 0*17* P| PjWj 7 ^ 0, implying that {17*} U {IT*} 
spans a simplex in Mc(s+ 2 )£- Hence Se^c(s+ 2 )e and tt^ o are contiguous. □ 


Claim 6. The maps Le,c{s+ 2 )£ and ipc{s+ 2 )s ° T^e in (7.1) are eontiguous. 


Proof. Specihcally, consider a = {xq, ... ,Xk} G Cg, and let I 4 := 7rg(x*) for each i G 
{0,..., k}. By dehnition of we have that B^[xi) C I 4 for i G {0,..., k}. 

Let X* := 4 c(s+ 2)£(14) for each i G {0,... ,k}. Then, since 14 belongs to f*{lAc{s+ 2 )e)) by 
dehnition of x* and Lemma 7.5, we see that 14 C Bc(s+ 2 )e{.Xi), for each i G {0,..., 4} , and 
thus 

Hg(x*) C Hc(g+2)£(7*) for i G {0,..., k}. 

On the other hand one trivially has B^{xi) C Bc(s+ 2 )e{a:i) for z G {0,..., k}, so that then 
Hg(x*) C Hc(g+2)£(7*) O Bc(s+ 2 )s{xi) for z G {0,..., k}. Thus, 

0 7 ^ 0*7?g(x*) 4 7?*;(g_|_2)g(Xj) P B(,(^g_i_ 2 'js{iii) ■ 

i 

Hence vertices in te,c(s+ 2 )e(o') P'ipc{s+ 2 )e°'^e{o') span a simplex in Cc(s+ 2 )e establishing that 
the two maps ie,c(s+ 2 )e and 'ipc{s+ 2 )e ° ^Tg are contiguous. □ 

The following two claims are also true. Their proofs use arguments similar to the ones 
given in the proof of Claims and above and are ommitted. 


Claim 7. The maps Tie' ° and Sc{s+ 2 )e,c(s+ 2 .)s' ° in (7.8) are eontiguous. 


Claim 8. The maps z-g^g/ o xp^ and 4e' o z-g^g/ in (7.9) are contiguous 


The rest of the proof of Theorem |7.7| follows steps similar to those in the proof of Propo¬ 
sition 14.51 □ 


8 Discussion 

In this paper, we proposed, as well as studied theoretical and computational aspects of 
multiscale mapper, a construction which produces a multiscale summary of a map on a 
domain using a cover of the codomain at different scales. 

Given that in practice, hidden domains are often approximated by a set of discrete 
point samples, an important future direction will be to investigate how to approximate the 
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multiscale mapper of a map f : X ^ Z on the metric space X from a finite set of samples 
P lying on or around X as well as function values / : P —)■ Z at points in P. It will 
be particularly interesting to be able to handle noise both in point samples P and in the 
observed values of the function /. 

We also note that the multiscale mapper framework can be potentially extended to a 
zigzag-tower of covers, which will further increase the information encoded in the resulting 
summary. It will be interesting to study the theoretical properties of such a zigzag version 
of mapper. Its stability however appears challenging, given that the theory of stability 
of zigzag persistence modules is much less developed than that for standard persistence 
modules. Another interesting question is to understand the continuous object that the 
mapper converges to as the scale of the cover tends to zero. In particular, does the mapper 
converge to the Reeb space [IS] ? 

Finally, it seems of interest to understand the features of a topological space which are 
captured by Multiscale Mapper and its variants. Some related work in this direction in the 
context of Reeb graphs has been reported recently Haim. 
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A The instability of Mapper 

In this section we briefly discuss how one may perceive that the simplicial complexes produced 
by Mapper may not admit a simple notion of stability. 

As an example consider the situation in Figure]^ Gonsider for each 5 > 0 the domain Xs 
shown in the hgure (a topological graph with one loop), and the functions depicted in hgure 
these are height functions fs and gs which differ by 5, that is \\fs — S'^Hoo = d. The open 
cover is shown in the middle of the hgure. Notice that the Mapper outputs for these two 
functions w.r.t. the same open cover of the co-domain M are different and that the situation 
can be replicated for each 5 > 0. 

In contrast, one of the features of the Multiscale Mapper construction is that it is 
amenable to a certain type of stability under changes in the function and in the tower 
of covers. This situation is not surprising and is reminiscent of the pattern arising when 
comparing standard homology (and Betti numbers) computed on hxed simplicial complexes 
to persistent homology (and Betti barcodes/persistent diagrams) on tower of simplicial com¬ 
plexes (i.e. hltrations). 

In what follows we will introduce a particular class of towers of covers that will be used 
to express some stability properties enjoyed by Multiscale Mapper. 


B Good towers of covers 

Remark B.l. We make the following remarks about the notion of {c,s)-good towers of 
covers: 

• The underlying intuition is that an “ideal” tower of covers is one for which c = 1 and 
s = 0. 

• A first example is given by the following construction: pick any s G [0, diam(Z)] and 
for e > s let := {i?(z, |), 2 ; G Z}, i.e. the collection of all e-balls in Z. The 
maps Ws^s' for e' > e are defined in the obvious way: B{z, |) h-)■ B{z, |-), all z E Z. 
Then, since any O <Z Z is contained in some ball i?(o, diam(O)) for some o E O, this 
means that {lAe}e>s is {2,s)-good. Of course whenever Z is path connected and not a 
singleton, this tower of covers is infinite. A related finite construction is given next. 
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Figure 2: A situation in which Mapper yields two different answers for similar functions on 
the same domain. The construction can be carried out for each 5 > 0. 


A discrete set P C Z is called an i/-sample of (Z, dz) if for any point z & Z, dz{z, P) < 
v. Given a finite u-sampling P of Z one can always find a (3, 21/) -good tower of covers 
of Z consisting, for each scale e > 2i/, of all balls B{p, ^), p & P■ Details are given in 
Appendix \B.^ This family may not he space efficient. An example of how to construct 
a space efficient {c,s)-good tower of covers for a compact metric space {Z,dz) can be 
found in Appendix\B.^ 


• Conditions (1) and (2) mean that the tower begins at resolution s, and the resolution 
parameter e controls the geometric characteristics of the elements of the cover. Param¬ 
eter s is clearly related to the size/finiteness o/il.- if s is very small, then the number 
of sets in Us for e larger than but close to s has to be large. Also, reguiring s to be 
smaller than or egual to the diameter of Z ensures that eondition 3 is not vaeuously 
satisfied. 


• Condition (3) controls the degree to which one ean inject a given set in Z inside an 
element of the eover. The situation when c> 1 is eonsistent with having finite covers 
for all e. On the other hand, c = 1 may reguire, for each e, open covers with infinitely 
many elements. 


• If ii is (c, s)-good, then it is (c', s')-good for all d >c and s' > s. 


B.l The necessity of (c, s)-good tower of covers. 

In Section 


.2 


we aim to obtain a stability result for the multiscale mapper MM(il, /) with 
respect to perturbations of /. First, notice that we need to bound the resolution of an tower 
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of covers from below by a parameter s > 0 to ensure its finiteness. We explain now why we 
need the second parameter c for the stability result. 

Specihcally, in what follows, we provide an example of a tower of covers of M which 
does not satisfy the condition 3 in (c, s)-goodness. This causes the persistence diagram 
DiMM(il, /) to be unstable with respect to small perturbations of /. 


Construction of a pathological tower of covers. 

Given M > 0 let I := [—M, M]. For each e > 0 and A; G Z let := ^ (fc + l) . 2 l-*°S 2 J]. 

Pick any s > 0 and consider the following tower Q2J of covers of I by closed intervals: 


where 


2IJ= {W, 


s<s<e'^ 


>V, := I fceZs.t. 4"^nl^0}. 


The maps of covers We,e' '■ hVe —)■ We' are dehned below. 


Claim 9. For any e' > e, and each element G We, there exists a unique fc' G Z such 
that C 

For each /c G Z we set We^e'i^k b = iF where A:' G Z is given the claim above. 

Proof. (Proof of the claim) Write q = [log^J and q' = [log| \ = q + n for some n G N. We 
are going to produce A:' G Z such that k' -2^' < k ■ 2'^ and {k + 1) ■ 2'^ < {k' + 1) ■ 2^', which 
will immediately imply that C Iy\ 

Choose k' to be the largest integer such that k' ■ 2“^ < k ■ 2^. This means that A:' G Z is 
maximal amongst integers for which A:' ■ 2” < k. By maximality, we have that {k' + 1) ■ 2"’ > 
A: + 1. Then, this means that {k' +1) ■ 2'?' > (A: +1) ■ 2'^, which implies that indeed C Iy\ 
In fact, ^ is the unique interval in We' that contains due to the fact that intervals 
in We' have disjoint interiors. □ 


Remark B.2. By the argument in the proof of the above claim, it also follows that for any 
S < e < s' < e" and any I G We, We^e"{I) = 'U}e',e"{'U)e,e' {!))', that is, We,e" = We'^e" ° 
Hence the maps {we,e' ■ We —)■ We'}s<e<£' valid maps of covers, satisfying the conditions 

in Definition^ Hence W is a tower of (closed) covers. 


Remark B.3. This natural tower of covers W is not (c, s)-good for any constant c. Specifi¬ 
cally, consider an arbitrary small interval (—r, r) for any r > 0. Clearly, there is no element 
in any We that contains this interval. It turns out that the persistence diagram arising from 
multiscale mapper is unstable w.r.t. perturbations of the input function, as we will show by 
an example shortly. Note that, in contrast, stability of the persistence diagrams of multiscale 
mapper outputs is guaranteed for (c, s)-good tower of covers, as stated in Corollary 4.6 and 
Theorem\4.2 ■ 
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Figure 3: / and g. 


An example. 

We show how instability of multiscale mapper may arise for some choices of 2IJ using the 
following example which is similar to the one presented in Figure Let 5 be any value larger 
than s, the smallest scale of the tower of covers 2IJ. Suppose we are given two functions 
dehned on a graph G\ /, : G —)• M, as shown in Figure It is clear that ||/ — (71100 = 

DiMM(Q2J,/) consists of the point (s,2(5), indicating that a non-null homologous loop 
exists at scale s in the pullback nerve complex N{f*{yVs)), but is killed at scale 26 in the 
nerve complex N{f*{yV 25 )) (specihcally, the image of this loop under the simplicial map 
f*{Ws,25) becomes null-homologous). 

However, for the function g, there is a loop created at the lowest scale s, but the homology 
class carried by this loop is never killed under the simplicial maps g*{ws^e') '■ N{g*{yVs)) —)■ 
N{g*{W^i) for any e' > s. Thus the persistence diagram DiMM(2IJ, consists of the point 
(s, oo). Hence the two persistence diagrams DiMM(Q2J,/) and DiMM(QIf, ^f) are not close 
under the bottleneck distance (in fact, the bottleneck distance between them is oo), despite 
the fact that the functions / and g are 5-close , that is ||/ — 5'||oo < 5. 

Remark B.4. We remark that for clarity of presentation, in the above construction, each 
element in the covering We is a closed interval. However, this example can be easily extended 
to open covers: Specifically, let v be a sufficiently small positive value such that u < s < 6. 
Then we can change each interval I = [k ■ {k + 1) ■ 2 l-^°® 2 J] g ^ ■ 2l-^°®2-l — 

u, (fc -(-1) ■ 2 l-^°® 2 J -|- i,y Note that for an arbitrary small interval (—r, r) with r > u, there is 
no element in any Wg that contains it, so that the resulting tower of covers is not (c, s)-good 
for any c > 1. Hence the example described in Figure^ can be adapted whenever 6 > v; this 
leads to the following statement: 

Proposition B.l. Fix s > 0 and W > s. Then, for each ^ > 5 > 0 there exist (1) a 
topological graph Gs, (2) two continuous functions fs,gs ■ Gs —)■ [—PF, PF], and (3) a tower 
of covers QU of [—PF, PF] satisfying the following properties: 

• 11/5 - = d. 


31 









DiMM(2n, fs) = {(s, 25)} and DiMM(2IJ, gs) = {(s, cx))}. 


B.2 Constructing a good towers of covers. 

Suppose we are given a compact metric space {Z, dz) with bounded doubling dimension. We 
assume that we can obtain an u-sample P of {Z, dz), which is a discrete set of points P C Z 
such that for any point z E Z, dz{z,P) < z/. For example, if the input metric space Z is 
a d-dimensional cube in the Euclidean space we can simply choose P to be the set of 
vertices from a d-dimensional lattice with edge length u. For simplicity, assume that z/ < 1 
(otherwise, we can rescale the metric to make this hold). 

B.2.1 A simple (c, s)-good tower of covers. 

Consider the following tower of covers 2IJ = {We | e: > 2z/} where W^ := {B|(m) | u G P}. 

The associated maps of covers : We —)■ We' simply sends each element B|(m) G We to 

the corresponding set Bp{u) G We'. It is easy to see that QU is (3,2z/)-good. Indeed, given 

2 

any O P Z with diameter R = diam(O) > s = 2z/, pick an arbitrary point o E O and let 
u E P he Si nearest neighbor of o in P. We then have that for any point x E O, 

3R 

dz{x, u) < dz{x, o) -1- dz{o, u) < R + v < —. 


That is, O C E Wsr. 

This tower of covers however has large size. In particular, as the scale e becomes large, 
the number of elements in Wg remains the same, while intuitively, a much smaller subset 
will be sufficient to cover Z. In what follows, we describe a different construction of a good 
tower of covers based on using the so-called nets. 


B.2.2 A space-efRcient (c, s)-good tower of covers. 

Following the notations used by Har-Peled and Mendel in [21] , we have the following: 

Proposition B.2 (|2I])- For any constant p > 11, and for any scale i E M"*", one can 
compute a p^-net f/{i) P P in the sense that 

(i) for any pE P, dz{p,Af{i)) < p^; 

(ii) any two points u,v E A5(£), dz{u,v) > p^~^/lQ, and 

(Hi) AT^f) C Af{£) for any i < £'; that is, the net at a bigger scale is a subset of net at a 
smaller scale. 


Each Af{i) is referred to as a net at scale i. 


From now on, we set Si := 4(p -|- 1)* for any positive integer i E Z~^. Consider the 


collection of nets {A/'(i) | i E Z+} where A/'(i) is as described in Proposition B.2 
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Definition 13. We define a tower of covers il = {U^. \ i G Z"*"} where: 

t4i := I M G A/'(i)}, where Si = A{p + . (B.IO) 

The associated maps of covers «£;,£■ ; , for any i,j G with i 7 ^ j, are defined 

as follows: 

(1) : We- —> For any element U = B^(-u) G Wg., we find the nearest neighbor 

V G Af(i + 1) of u in Af(i), and map U to V = Be^(t>) G Wg,,. . 

(2) For i < j — 1, we set o,s the concatenation o Uej- 2 ,ej-i o ■ ■ ■ o 

Remark B.5. We note that the above tower of covers il is discrete in the sense that we 
only consider covers Ue for a discrete set of es from {sj | i G Z+}. However, one can easily 
extend it to a tower of covers it®""* = which is defined for all 6 G M+ by declaring that 

where ({ 6 ) = max{z G Z| e* < 5}. One may define the cover maps in il®^* in 
a similar manner. For simplicity of exposition, in what follows we use a discrete tower of 
covers. 


Claim 10. For any i G Z"*", U^. forms a covering of the metric space Z (where P are sampled 
from). 


Proof. P is an i/-sampling of {Z, dz) with u < 1. Hence for any point x & Z, there is a point 
Px & P snch that d{x,Px) < s. By Property (i) of Definition B.2, dz{Px,d^(,i)) < ph Hence 
dz{x,N'{i)) < dz{x,Px) + dz{Px,d\f (i)) < 1 + p* < (p + 1)*. As such, there exists u G A/'(i) 
such that dz{x,u) < (p + 1)*; that is, x G H = B^(m) G Wg.. □ 

Claim 11. For any i < j and any U G W^-, we have U C Us^,£fiU). 

Proof. We show that the claim holds true for j = i + 1, and the claim then follows from this 
and the construction of «£;,£ for i < j — 1. 


Suppose U = B£i(M) for M G Nii), and let V = 


u. 


'£i,£i+l 


(U). By construction, H = B^i+i (v) 


is such that v is the nearest neighbor of u in f/{i + 1). li u = v, then clearly U FV. 

li u y V, by Property (i) of Definition B.2, dz{u,v) < At the same time, any point 
X E U is within ^ = 2(p + 1)* distance to u. Hence 


dz{x, v) < dz(x, u) + dz{u, v) < 2(p + 1)* + < 2 (p + 1) 


\i+l 


It then follows that P C V. □ 

Theorem B.3. il as constructed above is a (c, s)-good tower of covers with c = s = 4(p+1). 


Proof. First, note that by Claim 10, each W£ G il is indeed a cover for {Z,dz). By Claim 


11 the associated maps as constructed in Definition 13 are valid maps of covers. Hence .il is 


indeed a tower of covers for {Z, dz). We now show that il is (c, s)-good. 

First, by Eqn. |B.10 each set U in obviously has diameter at most £*. Also, it is s- 
truncated for s = 4(p+ 1) since it starts with Si = s. Hence properties 1 and 2 of Definition 
hold. What remains is to show that property 3 of Definition also holds. 
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Specifically, consider any O C Z such that diani(O) > s. Set R = diam(O) and let a be 
the unique integer such that 


(p+l)““^ <R< (p + l)“. 


Let o E O he any point in O, and p E P the nearest neighbor of o in P: By the sampling 
condition of P, we have dz{o,p) < u < 1. Let u E Af{a) be the nearest neighbor of p in the 
net Af{a). By Dehnition B.2 (i), dz{p,u) < p°‘. We thus have that, for any point x E O, 


dz{x,u) < dz{x,o) + dz{o,p) + dz{p,u) 
<R + e + p‘^ 

< (p + 1)“ + 1 + 

<2(p+l)L 


In other words, O C B 2 (p+i)a{u) = B^(m) G 

On the other hand, since R> {p + 1)“”^, we have that for c = 4(p + 1), 

Ea = 4(p + 1)“ = c • (p + 1)““^ < cR = c - diam(O). 


□ 


Space-efficiency of it. 

In comparison with the simple (3,2p)-good tower of covers QU that we introduced at the 
beginning of this section, the main advantage of it is its much more compact size. Intuitively, 
this comes from property (ii) of Proposition [I^ which states that the points in the net Af{i) 
are sparse and contain little redundancy. In fact, consider the covering at scale e*. It 
size (i.e, the cardinality of is close to optimal in the following sense: 

For any e, denote by V*(e) the smallest possible (in terms of cardinality) covering of 
{Z,dz) such that each element V E V*(£) has diameter at most £. Now, let s*(£) := |V*(£)| 
denote this optimal size for any covering of {Z, dz) by elements with diameter at most e. 


Proposition B. 4. For any i E = |A/'(i)| < 


Proof. The left inequality follows from the dehnition of We now prove the right 

By 


inequality. Specihcally, consider the smallest covering V* = V*(^) with = |V 

each set V E V* can contain at most one point from 7V(i 


property (ii) of Proposition 13 


since the diameter of V is at most ff-. At the same time, we know that the union of all sets 

16p ’ 


in V* will cover all points in A/'(f). The right inequality then follows. 


□ 
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